Overview of FIRE 2014 Track on Transliterated Search

نویسندگان

  • Monojit Choudhury
  • Gokul Chittaranjan
  • Parth Gupta
  • Amitava Das
چکیده

The Transliterated Search track has been organized for the second year in FIRE. The track has two subtasks. Subtask 1 on language labeling of words in code-mixed text fragments was conducted for 6 Indian languages: Bangla, Gujarati, Hindi, Malayalam, Tamil, Telugu, mixed with English. In Subtask 2 on retrieval of Hindi film lyrics, along with transliterated queries in Roman script, this year we also had Devanagari queries. A total of 54 runs were submitted from 18 teams, of which 35 runs for subtask 1 and 7 runs for subtask 2 has been evaluated. Performance of the runs is comparable to that of last year’s.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Overview of FIRE-2015 Shared Task on Mixed Script Information Retrieval

The Transliterated Search track has been organized for the third year in FIRE-2015. The track had three subtasks. Subtask I was on language labeling of words in code-mixed text fragments; it was conducted for 8 Indian languages: Bangla, Gujarati, Hindi, Kannada, Malayalam, Marathi, Tamil, Telugu, mixed with English. Subtask II was on ad-hoc retrieval of Hindi film lyrics, movie reviews and astr...

متن کامل

Overview of the Mixed Script Information Retrieval (MSIR) at FIRE-2016

The shared task on Mixed Script Information Retrieval (MSIR) was organized for the fourth year in FIRE-2016. The track had two subtasks. Subtask-1 was on question classification where questions were in code mixed Bengali-English and Bengali was written in transliterated Roman script. Subtask-2 was on ad-hoc retrieval of Hindi film song lyrics, movie reviews and astrology documents, where both t...

متن کامل

LIGA and Syllabification Approach for Language Identification and Back Transliteration : Shared Task Report by DAIICT

This paper aims to address the solution for the Subtask 1 of Shared Task on transliterated search,a task in FIRE ’14. The task addresses the problem of data containing English words and transliterated words of Indian languages in English.The task calls for language identification and subsequent back transliteration into the native Indian scripts.The system proposed herewith implements Language ...

متن کامل

A Relevance feedback based approach for mixed script transliterated text search: Shared Task report by BIT Mesra, India

This paper describes the experiments carried out as part of the participation in FIRE-2014 Transliterated Search Shared task. We participated in subtask-2 and submitted two results generated by systems based on relevant feedback approach. Given a collection of documents in mixed script, the task is to retrieve relevant documents using queries in either script. The spelling variation between dif...

متن کامل

Language identification for transliterated forms of Indian language queries

Language identification has a number of applications in natural language processing. N Gram analysis has been the traditional method for language identification. In this paper, we discuss the methods and results from our participation in the Shared Task on Transliterated Search track at Forum for Information Retrieval Evaluation, 2014. We describe a method that leverages the phonetical properti...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014